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Abstract 


Severe acute respiratory syndrome (SARS) is an emerging infectious disease associated with a novel coronavirus and causing 
worldwide outbreaks. SARS coronavirus (SARS-CoV) is an enveloped RNA virus, which contains several structural proteins. 
Among these proteins, spike (S) protein is responsible for binding to specific cellular receptors and is a major antigenic determinant, 
which induces neutralizing antibody. In order to analyze the antigenicity and receptor-binding ability of SARS-CoV S protein, we 
expressed the S protein in Escherichia coli using a pET expression vector. After the isopropyl-B-p-thiogalactoside induction, S 
protein was expressed in the soluble form and purified by nickel-affinity chromatography to homogeneity. The amount of S protein 
recovered was 0.20.3 mg/100 ml bacterial culture. The S protein was recognized by sera from SARS patients by ELISA and Western 
blot, which indicated that recombinant S protein retained its antigenicity. By biotinylated ELISA and Western blot using biotin- 
labeled S protein as the probe, we identified 130-kDa and 140-kDa proteins in Vero cells that might be the cellular receptors re- 
sponsible for SARS-CoV infection. Taken together, these results suggested that recombinant S protein exhibited the antigenicity and 


receptor-binding ability, and it could be a good candidate for further developing SARS vaccine and anti-SARS therapy. 
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Severe acute respiratory syndrome (SARS) is a newly 
emerging human disease, resulting globally in 774 deaths 
from 8098 probable cases [1]. A novel coronavirus has 
been identified as the etiological agent of SARS and 
designated as SARS coronavirus (SARS-CoV) after 
tests of causation according to Koch’s postulates, in- 
cluding monkey inoculation [2-4]. SARS-CoV can infect 
African green monkey kidney (Vero E6) cells and cause 
a similar disease in cynomolgus macaques (Macaca 
fascicularis) [2,3]. The full-length genome sequence of 
SARS-CoV has been elucidated within weeks after the 
identification of this novel pathogen [5,6]. SARS-CoV 
contains a single-stranded plus-sense RNA genome 
about 30kb in length that has a 5’-cap structure and a 
3'-polyadenylation tract. The genomic organization is 
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typical of coronaviruses, having five major open reading 
frames (ORFs) that encode the replicase polyproteins; 
the spike (S), envelope (E), and membrane (M) glyco- 
proteins; and the nucleocapsid protein (N) in the same 
order as those of other coronaviruses [5-7]. 

When coronaviruses enter cells, the 5’-region of viral 
genome is translated into a large polypeptide that is 
cleaved by viral-encoded proteases to release RNA-de- 
pendent RNA polymerase and adenosine triphospha- 
tase/helicase. These proteins, in turn, are responsible for 
replicating the viral genome as well as generating nested 
transcripts that are used in the synthesis of viral pro- 
teins. Viral membrane proteins, including S and M are 
inserted into the endoplasmic reticulum (ER), while 
RNA genome assembles with the N protein. This RNA- 
protein complex then associates with M proteins and 
buds into the lumen of the ER. The virus particles then 
migrate through the Golgi complex and exist in the cells 
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by exocytosis [7]. The first step in viral infection is the 
binding of viral proteins to certain cellular receptors. So 
far, the S protein of coronavirus is considered as the site 
of viral attachment to the host cells [8,9]. 

The S proteins of coronaviruses are large type I 
membrane glycoprotein projections from viral envelope 
[10]. S proteins are responsible for both binding to re- 
ceptors on host cells and for membrane fusion [11,12]. S 
proteins also contain important virus-neutralizing epi- 
topes that elicit neutralizing antibody in the host species 
[13,14]. Furthermore, mutations in this gene dramati- 
cally affect the virulence, pathogenesis, and host cell 
tropism [15-17]. These results suggested that S protein is 
a good candidate for vaccine because neutralizing anti- 
bodies are directed against S. Moreover, S protein is also 
a good target for antiviral therapies because blockade of 
binding of S protein to cellular receptor can prevent 
virus entry. Therefore, in this study, we expressed the 
SARS-CoV S protein in Escherichia coli (E. coli). The 
antigenicity and receptor-binding ability of recombinant 
S protein were further analyzed. 


Materials and methods 


Construction of recombinant plasmids. SARS-CoV RNA was pro- 
vided by Department of Medical Technology, China Medical University 
Hospital. SARS-CoV S gene was kindly provided by Dr. P.J. Chen. 
Briefly, the RNA was reverse transcribed into cDNA using SARS-M2 
primer (5’/-GGAATTCGCCAAACATACCAAGGCC-3’) and Super- 
Script II (Invitrogen) according to the manufacturer’s protocol. The 
3580-bp DNA fragment was then amplified from the cDNA template by 
polymerase chain reaction (PCR) using PfuUlter DNA polymerase 
(Stratagene) with SARS-P1 primer (5/-CGGGATCCTAGTGGTAGT 
GACCTTGACC-3’) and SARS-M2 primer. The PCR profile was as 
follows: one cycle at 95 °C for 2 min; 30 cycles at 95 °C for 30s, 55 °C for 
30s, and 72 °C for 2 min. The amplified S cDNA fragments were cleaved 
by EcoRI and BamHI, and the 3580-bp fragments were then inserted into 
pET-28b(+) (Novagen) to create the pET-spike expression plasmid. 


A 5 kb 10 kb 


DNA sequencing. DNA sequencing was performed on double- 
stranded plasmids by dideoxy chain termination with the BigDye Ter- 
minator Cycle Sequencing Ready Reaction kit (Applied Biosystems) and 
analyzed by ABP Prism 310 (Applied Biosystems). Dimethyl sulfoxide 
was introduced to the reaction to a final concentration of 5% to diminish 
the secondary structure. The primer used in the sequencing reaction was 
T7, SARS-P2 (5’-CGGGATCCTAATTACACCTGGAACAAATGC- 
3’), T7 terminator, SARS-M2, SARS-M3 (5/-TTGATATAGAACAGC 
AACTTCAGATGAAGC-3’), or SARS-M1 (5'-GGAATTCCACTTA 
CACCCCCAAAAGAGC-3’). DNA was sequenced on both strands of 
at least two repeats of cloned DNA fragments. 

Expression and purification of recombinant SARS-CoV S protein. 
Recombinant protein was expressed in E. coli BL21(DE3)pLysS strain 
by transforming the pET-spike to produce an N-terminal fusion with 
six histidine residues. The expression and purification of recombinant S 
protein were performed as described previously with modification [18]. 
Briefly, cells were grown in 100ml Luria-Bertani broth agitated at 
37°C until OD¢oo reached 0.6. Isopropyl-B-p-thiogalactoside (IPTG) 
was then added to a final concentration of 0.5mM and the cells were 
pelleted 2h after induction. The cell pellet was washed twice with ice- 
cold binding buffer (20mM Tris-HCl, pH 7.9, 500mM NaCl, and 
5mM _ imidazole) and resuspended in 5ml ice-cold binding buffer 
containing 8M urea. The resuspended pellet was incubated at 4°C 
with shaking for 2h. After sonication, the suspension was centrifuged 
at 16,000g for 30 min at 4°C. The resulting supernatant was applied on 
nickel-affinity chromatography with 8M urea present throughout the 
procedure. The protein in the final column eluate was dialyzed over- 
night against phosphate buffered saline (PBS) (137 mM NaCl, 2.7mM 
KCl, 4.3mM Na,HPO,, and 1.4mM KH>,PO,) and stored at —70°C 
until further analysis. Protein was analyzed by sodium dodecyl sulfate— 
polyacrylamide gel electrophoresis (SDS-PAGE) and was quantified 
with a Bradford assay (Bio-Rad). 

Cells and cell extracts. Human lung carcinoma cell (A-549), human 
liver cell (Chang liver), African green monkey kidney cells (Vero E6, 
Vero), and human monocyte (U-397) were purchased from Bioresources 
Collection and Research Center (Hsinchu, Taiwan). Cells were main- 
tained in Dulbecco’s modified Eagle’s medium supplemented with 10% 
fetal bovine serum at 37°C and 5% CO) in air. The cell extracts were 
prepared by washing the cells once with ice-cold PBS and lysing the cells 
by lysis buffer (150 mM NaCl, 1% NP-40, 50 mM Tris-HCl, pH 8.0, and 
20 uM phenylmethylsulfonyl fluoride). After a 30-min-incubation on ice, 
the cell lysate was collected by centrifugation. Protein was quantified 
with a Bradford assay and stored at —30°C until further analysis. 
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Fig. 1. Genome organization of SARS-CoV and putative region of SARS-CoV S protein. (A) Overall organization of the 29,727-nt SARS-CoV. 
Predicted ORF la and ORF 1b encode replicase 1A and 1B, respectively. S, E, M, and N genes encode spike, envelope, membrane, and nucleocapsid 
proteins, respectively. S gene is situated at 21,492—25,259 nucleotide in the SARS-CoV genome. (B) Predicted functional regions of SARS-CoV S 
protein. The upper diagram represents the 1255-residue S protein. Regions spanning 1-13 and 1196-1255 are putative signal peptide, and trans- 
membrane and cytoplasmic regions, respectively. Putative receptor-binding site is located at residue 757-761. 
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Fig. 2. Multiple alignment of coronavirus S proteins. Amino acid sequences of S proteins from human respiratory coronavirus (HCoV-229E) 
(GenBank Accession No. P15423), porcine transmissible gastroenteritis virus (TGEV) (GenBank Accession No. CAA37285), feline infectious 
peritonitis virus (FIPV) (GenBank Accession No. P10033), human respiratory coronavirus (HCoV-OC43) (GenBank Accession No. AAA03055), 
murine hepatitis virus (MHV) (GenBank Accession no. AAC56567), porcine hemagglutinating coronavirus (HEV) (GenBank Accession No. 
AAM77000), and bovine coronavirus (BCoV) (GenBank Accession No. AAF25499) were aligned with those of SARS-CoV by PileUp program. 
Residues that are similar in all coronaviruses are shown in uppercase. The locations of signal peptide, and transmembrane and cytoplasmic regions of 
SARS-CoV S protein are underlined. The putative receptor-binding site of SARS-CoV S protein and antigenic sites of TGEV are shown in bold. 


T.-Y. Ho et al. | Biochemical and Biophysical Research Communications 313 (2004) 938-947 941 


631 720 
SARS-CoV ........ COVES ccisrareis ajenmimlerar eee’ cadre ereraeiate, yet rerae eve yao yRvv...Vls felLna..pa tvcgpkl... stD...likn 
HCoV-229E ...... sasI ntGneptsig ‘kvanfvkégs vCfslkdipg gcampivanw ayskyytIgs lyvswsdgdg itgvpqpveg vssfmnvtld 


TGEV tdvldatavI ktGtcpfsfd klnnyltfnk fClslspvga nckfDvaar. tRandqvVrs lyvIyeegdn ivgvpsdnsg lhDlsvlhld 
FIPV tdvlEatavI ktGtcpfsfd klnnyltfnk fClslspvga nckfDvaar. tRtneqvVrs lyvIyeegdn ivgvpsdnsg lhDlsvlhid 
HCoV-0C43 ..cpQtkslV giGehcsgla vksdycgg.n sCtcrpgqafl gwsaDsclqg ltcstdlqka ntD...iilg vCvnYdLYGI 
MHV ..clQarsmL gvGdhceglg vledkcggsn tCncsahafv gwakDsclan gRch...Ifs nlmLnginsg ttcsmdlqlp ntE...vvtg 
HEV ..cpQskvsI qpGqghcpglg lveddcsg.n pCtckpgafi gwssEtclqn gRcn...Ifa nfiLndvnsg ttcstdlqqg ntn...ittd 
BCoV ..cpQtkylvV giGehcsgla iksdycgg.n pCtcqpgqafl gwsvDsclqg dRcn...Ifa nfiLhdvnsg ttcstdlqks ntD...iilg 
Consensus ----Q----I --G------- anne -C-------- ----D----- -R-----I-- ---L------ ---------- --D------- 


721 800 

SARS-CoV gCvnFnfnGL tGtGVltp.s skrfqpfqqf grdv.sdftd svRDpktsei ldIspCsfgg vsvitpgtna ssevavlYqd vnctdVstai 
HCoV-229E kCtkYnIYdV sGvGVirvsN dTfln...gI tYtStsGnLl GFKDvtkgti YsItpCnppd qlvvyqqavv gamlsenFts ygfsnVve.. 
TGEV sCtdynIyGr sGvGlirgtN rTlls...gL yYtSlsGdLl GFKnvsdgvi YsVtpCdvsa qaAvidgtiv gaitsInsel Lglthwtt.. 

FIPV sCtdynIyGr tGvGlIirrtN sTlls...gL yYtSlsGdLl GFKnvsdgvi YsVtpCdvsa qaAvidgaiv gamtsInsel Lglthwtt.. 
HCoV-0C43 1GqGIfvevN aTyynswqnL lydS.nGnLy GFRDyiinrt FmIrsCysgr vsAafhan.. ssepaLlFrn IkcnyVfnns ltrqlqPi.. 
MHV vCvkYdLYGI tGqGIfkevk adyyhswqnL lydv.nGnLi GFRDfvanks YtIrsCysgr vsAayhqd.. apepaLlyrn LkcdyVfnnn 

HEV vCvnYdLYGI tGqGIlievN aTyynswqnL lydS.sGnLy GFRDylsnrt FlIrsCysgr vsAvfhan.. ssepalmFrn LkcshVfnyt 

BCoV vCvnYdLYGI tGqGIfvevN aTyynswqnL lydS.nGnLy GFRDyltnrt FmIrsCysgr vsAafhan.. ssepaLlFrn IkcnyVfnnt 


Consensus -C--Y-LYGI -G-GI----N -T------- L -Y-S--G-L- GFRD------ Y-I--C---- --A------- -----L-F-- L----V---- 
801 890 

SARS-CoV hadqltPawr iystgnnvFq tqaGclIgae hvdts..yec dipigaGiCa syhtvsllr. ..stsqk..s ivaytmslga DSsiaysn.n 
HCoV-229E ..... 1Pkff£f yas..ngtYn ct........ ics ORValy, BoosBEGICAdgeliavds. cxscrssce's ....prnvsy DSVsaIv.ta 
TGEV occies tPnfy yysiynytnD mtrGtaIdsn dvdcepViTy s...niGvCk ngalvfin.. .......... ...-vthsdg D.VQpIs.tg 

FIPV sicee tPnfy yysiynytsE rtrGtaIdsn dvdcepViTy s...niGvCk ngalvfin.. .......... +...vthsdg D.VOpIs.tg 
HCoV-O0C43 ...... nyFD sylGcvVnay nstaisVqTc dltvgsGyCv dysknrrsrg aittgyrftn fepftvnsvn DS1EpVggly eIqIPseFTI 
MHV isreetPl.. ...... nyFD sylGcvvnad nsteeaVdac dlrmgsGlCv nystshrars svstgykltt fepftvrivn DSVEsVdgly 

HRV TIPE Lie, winsey00 nyFD sylGcvVnay nntasaVsTc dltvgsGyCv dyvtalrsrr sfttgyrftn fepfaanlvn DSIEpVggly 

BCoY lerqligqPs.s <iceur nyFD sylGcvVnad nstssaVqTc dltvgsGyCv dystkrrsrr aittgyrftn fepftvnsvn DS1EpVggly 
Consensus ------ Pe-- -------- FD ---G--V--- ------ V-T- ------ G-C- ---------- ---------- ---------- DSVE-V---- 
891 980 


SARS-CoV tIaIPtnFsI sittEvmpvs maktsvDCnm yICgdsteCa nlLlqYgSfC tqLnraLsgi aaeqdrntrE vfaqvkqmyk tptlkyfG.. 
HCoV-229E nLsIPsnwIt svqvEYLQit stpIvVDCst yVCngnvrCv elLkqytSaC ktIedaLrns arLesadvsE mltfdkkaft LanvssfGd. 
TGEV nVtIPtnFTI svqvEYIQvy ttpVsIDCsr yVCngnprCn klLtqYvSaC qtIeqaLavg arLenmevds mlfvsenalk Lasveafnss 

FIPV nVtIPtnFTI svqvEYmQvy ttpVsIDCar yVCngnprCn klLtqYvSaC qtIeqaLamg arLenmevds mlfvsenalk Lasveafnst 
HCoV-0C43 gnmvEFIQts spkVvtIDCaa fvCgdyaaCk sqLveYgSfC DnInaiLtev neLldttqlQ vanslmngvt LstklkdGvn fnvDdI..nf 
MHV eLqIPtnFTI ashgEFVQtr spkVtIDCaa fvCgghtaCr gqqLveYgSfC DnInaiLgev nnLidtmqlQ vasaligqgvt LssrlsdGig 

HEV eIqIPseFTI gnleEFIQts spkVtIDCat fvCgdyaaCr qqLaeYgSfC EnInaiLiev neLldttqlQ vanslmogvt LstkikdGin 

BCoV eIqIPseFTI gnmeEFIQis spkVtIDCsa fVCgdyaaCk sqLveYgSfC DnInaiLtev neLldttqlQ vanslmngvt LstklkdGvn 


Consensus -I-IP--FTI ----EFIQ-- ---V-IDC-- -VC-----C- --L--Y¥-S-C D-I---L--- --L------Q ---------- L------ G-- 
981 1070 

SARS-CoV ....gf..nf sqil...... ...pdplkpt k...RSfIED LLFnKVtlad aGfimkq.Yge ClGdinarDL ICaQkFNGLt VLPplltddm 
HCoV-229E ynlssVi... ps........ ....lptsgS rvagRSAIED ILFsKlvtSg 1GtVdadykk CTkGlsIaDL aCaQyYNGIm VLPgvadaer 


TGEV etlDpIykew pnigGswleg lkyilpsdnS krkyRSAIED LLFsKVvtSg 1GtVdedykr CTGGyDIaDL VCaQyYNGIm VLPgvanadk 
FIPV enlDpIykew psigGswlgg lkdilpshnS krkygSAIED LLFdKVvtSg 1GtVdedykr CTGGyDIaDL VCaQyYNGIm VLPgvanadk 


HCoV-0C43 spvlGclgse cska..... S s...RSAIED LLFdKVk1Sd vGfVea.Ynn CTGGaEIrDL ICvQsYkGIk VLPpllseng isgYTlaats 
MHV gqiDdI..nf spllGclgsd cgevtmaagt g...RSAIED vLFdKVk1Sd vGfVea.Ynn CTGGgEVrDL LCvQsFNGIk VLPpvlseng 

HEV fnvDdI..nf ssvlGclgse cnra.....S t...RSAIED LLFdKVk1Sd vGfVqa.Ynn CTGGaEIrDL ICvQsYNGIk VLPpllsenq 

BCov fnvDdI..nf spvlGclgsd cnkv..... S s...RSAIED LLFsKVk1Sd vGfVea.Ynn CTGGaEIrDL ICvQsYNGIk VLPpllsenq 
Consensus ---D-I---- ----G----- --------- S ----RSAIED LLF-KV--S- -G-V---Y-- CTGG-EI-DL IC-Q-YNGI- VLP------- 
1071 1160 

SARS-CoV iaaYTaalvs gtatagwtfg aGaalqiPFa MqmayRfNgi gvtqnVLyeN QOKqIAngFNr AisqIgEslt tts....... ....... tAL 


HCoV-229E mamYTgslig gial...ggl .tsavsiPFs LaIQaRLNyv alqtDVLqeN QKiLAasFNk AmtnIvDaFt gvNdaitqts galqtvatAL 
TGEV mtmYTaslag gitl...gAl gGgavaiPFa vaVQaRLNyv alqtDVLnkN QqiLAsAFNq AignItQsFg kvNdaihqts rglatvakAL 
FIPV mtmYTaslag gitl...gAl gGgavaiPFa vaVQaRLNyv alqtDVLnkN QqiLAnAFNq AignItQaFg kvNdaihgts qglatvakAL 


HCoV-0C43 aslfppwtAa aG....vPFy LnVQyRINgl gvtmDVLsqN QKIIAnAFNn AlyaIgEgFd atN....... wsr.eee sAL vKIQaVVNan. 
MHV isgYTagatv samf.pwsAa aG....vPFs LsVQyRINgl gvtmnVLseN QKmIASAFNn AigaIgEgFa atN....... ....... SAL 

HEV isgYTsaata aslfppwtAa aG....vPFy LnVOyRINgl gvtmDVLsqN QK1IASAFNn AldsIqEgFd atN....... ....... sAL 

BCoV isgYTlaats aslfppwsAa aG....vPFy LnVQyRINgi gvtmDVLsqN QKI1IAnAFNn AldaIqEgFd atN....... .....+. sAL 
Consensus ---YT----- -------- A- -G----- PF- L-VQ-RIN-- ---- DVL--N QK-IA-AFN- AI--I-E-F- --N------- --------AL 
1161 1250 


SARS-CoV gKLOGVVNqn aQALNtLvkQ LssnFgAISs vLnDI1sRLD kvEAEvQIDR LItGRLqslLq tYvtQgLira aeIraSanlA atKmsECVlg 
HCoV-229E nKIQGVVNaq gnsLNhLtsQ LrqnFqAISs SIqaIydRLD tIQADQOVDR LItGRLaALN vFVShtLtky teVraSrqlA qOKVNECVKS 
TGEV aKVOdVVNtq gQALshLtvQ LqNnFqAISs SIsDIynRLD eLSADAhVDR LItGRLTALN AFVSQtLtrq aeVraSrqlA kDKVNECVRS. 
FIPV aKVOdVVNtq gQALshLtvO LqgNnFgAISs SIsDIynRLD eLsSADAQVDR LItGRLTALN AFVSQtLtrq aeVraSrqlA kDKVNECVRS 
HCoV-0C43 aEALNnL1qQ LsNrFgAISa SLqEII1sRLD aLEAEAQIDR LInGRLTALN AYVSQqLsds tlvkfSaaqA mEKVNECVKS 
MHV aKmQfVVNan aEALNnL1nQ LsNrFgAISa SLQEI1sSRLD aLEAGAQIDR LInGRLTALN AYVSkqLsdm tlvkvSaagA iEKVNECVKS 
HEV vKIQaVVNan aEALNnL1qQ LsNrFgAISa SLqEI1sRLD aLEAkAQIDR LInGRLTALN AYVSQqLsds tlVkfSaaqA iEKVNECVKS 
BCoV vKIQaVVNan aEALNnL1qQ LsNrFgAISs SLqEI1sRLD aLEAGAQIDR LInGRLTALN AYVSOgLsds tlVkfSaagA mEKVNECVKS 
Consensus -KIQ-VVN-- -EALN-L--Q L-N-F-AIS- SL-EI--RLD -LEADAQIDR LI-GRLTALN AYVSQ-L--- --V--S---A -EKVNECVKS 


Fig. 2. (continued) 
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1251 


1340 


SARS-CoV QSkRvdFCGk GyHLmSfpqa APhGvvFLHv tyvPsqernf ttapaIChe. GE...ayfpR e...gvFvfin gtswfiTqrn fFsPqiiTtd 
HCoV-229E QSkRygFCGN GtHIfSIvna APeGLvFLHt vl1lPTqykdv eawsGLCv.. .DgtnGyv1R gpnlalyke. gnyYriTsri mFePripTma 
TGEV QSqRfgFCGN GtHL£SLana APnGMiFfHa vllPTaYeTv tawaGICald GDrtfGlvvK dvqltlFrnl ddkfylTprt mYqPrvaTss 
FIPV QSqRfgFCGN GtHLfSLana APnGMiFfHt vllPTaYeTv tawsGICasd GDrtfGlvvK dvqltlFrnl ddkfylTprt mYqPrvaTss 
HCoV-0C43 QSsRinFCGN GnHTiSLvqn APyGLyFIHf syvPTkYvTa rvspGLCia. GDr..GiapK s...gyFvnv nntWmyTgsg yYyPepiTen 
MHV QSsRinFCGN GnHI1SLvqn APyGLyFIHf syvPTsFtTa nvspGLCis. GDr..GlapK a...gyFvqd dgeWkfTgsn yYyPepiTdk 
HEV QSsRinFCGN GnHIiSLvqn APyGLyFIHf syvPTkYvTa kvspGLCia. GDi..GispK s...gyFinv nnswWmfTgsg yYyPepiTgqn 
BCoV QSsRinFCGN GnHIiSLvqn APyGLyFIH£ syvPTkYvTa kvspGLCia. GDr..GiapK s...gyFvnv nntWmfTgsg yYyPepiTgn 


Consensus QS-R--FCGN G-HI-SL--- AP-GL-FIH- ---PT-y-T- ----GLC--- GD---G---K ------F--- ---W--T--- -y-p---T-- 
1341 1430 
SARS-CoV ntfvsgnCdV vigiinntvy dplqPeldsf k...EELdky FKnhts..PD vdlgdisgiN asvvnIqkEi d......... ..... RLsev 


HCoV-229E dfVqienCnV tFvnisrseL qtivPeyiDv nkt1lQEL..s YKlpnytvPD LvveqY...N qTiLnLtsEi stlenksael nytvqKLqtl 
TGEV dfVqiegCdV 1FvnAtlsdL psiiPdyiDi ngtvQDIlen FR.pnwtvPE LtfdiF...N aTyLnLtgEi ddlefrsekl hnttveLail 
FIPV dfVqiegCdv 1lFvnAtvidL psiiPdyiDi nqtvQDIlen YR.pnwtvPE ftldiF...N aTyLnLtgEi ddlefrsekl hnttveLail 


HCoV-0C43 nvVvmstCaV nytkApyvmL ntsiPnlpDf k...EELdqw FKngtsvaPD Lsl.dY..iN vTf£LdLqvEm n......... ..... RLgea 
MHV nsVvmssCaa nYtkApevfL ntsiPnlpDf k...EELdkw FKngtsiaPD Lsl.dFekl1N vT1LdLtdEm n......... ..... Rigda 

HEV nvVvmstCaV nYtkApdlmL ntstPnlpDf k...EELyqw FKnqsslaPD Lsf.dY..iN vTfLdLqdEm n......... ..... RLqea 

BCoV nvVvmstCaV nYtkApdvmL nistPnlpDf k...EELdqw FKnqtsvaPD Lsl.dy..iN vIfLdbqdEm n......... ..... RLqea 
Consensus --V----C-V -Y--A----L ----P---D- ----EEL--- FK------ PD L----Y---N -T-L-L--E- ---------- -----RL--- 
1431 1526 


SARS-CoV aknLNeslId LqeLgkyEqyY IKWP, 


Wiha fiacliaivn viiliccnts Ccsclkgec sCoscCk ide ddscovikay iibvt 
HCoV-229E IdnINstlVd LkwLnrvEtYy IKWPWWVWLc IsvvlIfvvs ML11cCCsTG Ccgffscfas sirgCcCestk lpyyd.Veki hiq~~~ 


TGEV IdnINnt1VN LewLnriEtY VKWPWYVWLL IgLvvIfcIp LL1IFcCCsTG Ccgcigclgs cChsiCsrrq fenyepIekv hih~~~ 
FIPV IdnINnt1VN LewLnriEtY VKWPWYVWLL IgLvvVfcIp LL1FcCfsTG Ccgcigclgs cChsiCsrrq fenyepIekv hvh~~~ 
HCoV-0C43 IkvLNqsyIN LkdIgtyEyY VKWPWYVWLL IcLagVamLv LLf£FiCcCcTG Cgtscfk...kCggCcCddyt gyqelvI... ktshdd 
MHV IkkLNesyIN LkdvgtyEmY VKWPWYVWLL IgLagVavev LL£FiCCcTG Cgsccfk...kCgncCdecg ghqdsiVihn isshed 
HEV IkvLNhsyIN LkdIgtyEyY VKWPWYVWLL IcLagVvmLv LLEFiCCcTG Cgtscfk...kCggCfddyt ghgefvI... ktshdd 
BCoV IkvLNqsyIN LkdIgtyEyY VKWPWYVWLL IgLagVamLv LL£FiCCcTG Cgtscfk...kCggCCddyt ghqelvI... ktshdd 
Consensus I--LN---IN L--L---E-Y VKWPWYVWLL I-L--V--L- LL-F-CC-TG C---------- C--CC---- ------ Loo es>sSs 


Fig. 2. (continued) 


Biotinylation of recombinant S protein. Recombinant S protein was 
mixed with Sulfo-NHS-biotin (Pierce) in a ratio of ten to one. After a 
2-h-incubation on ice, the unincorporated biotin was removed by 
centricon-10 (Amicon) and the biotinylated S protein was stored at 
4°C until further analysis. Sulfo-NHS-biotin should be prepared 
freshly by dissolving in DDW. 

Western blot analysis. Proteins (5 wg) were separated by 10% SDS-— 
PAGE and the protein bands were then transferred electrophoretically 
to nitrocellulose membrane (Amersham Pharmacia Biotech). Mem- 
brane was blocked in blocking buffer (20mM Tris-HCl, pH 7.6, 
140mM NaCl, 0.1% Tween 20, and 5% skim milk) and probed with 
rabbit anti-spike (residue 511-993) polyclonal antibody or human anti- 
SARS sera at room temperature for 1h. The bound antibody was 
detected with horseradish peroxidase-conjugated secondary antibody 
(Sigma) followed by chemiluminescence (ECL system, Amersham) and 
exposed by X-ray films. Rabbit anti-spike (residue 511-993) polyclonal 
antibody was prepared by immunizing rabbit with truncated spike and 
was provided kindly by Dr. P.J. Chen. Human anti-SARS sera were 
collected from SARS patients and provided by Dr. L.K. Chen. 

For biotinylated Western blot, cell extracts were separated by 10% 
SDS-PAGE and the protein bands were then transferred to mem- 
brane. After blocking, the membrane was probed with 0.1 ug/ml bio- 
tinylated S protein at 4°C for 1 h. The bound protein was detected with 
peroxidase-conjugated avidin (Sigma) followed by chemiluminescence 
and exposed by X-ray films. 

Enzyme-linked immunosorbent assay (ELISA). Microtiter plates 
(Nunc) were coated at 4°C overnight with 50 ul of 10ng/pl proteins, 
which was diluted in 0.05 M carbonate buffer (pH 9.6). The wells were 
rinsed with 200 ul washing buffer (0.5% Tween 20 in PBS) and blocked 
with 2001 blocking buffer (5% bovine serum albumin (BSA) in 
washing buffer) by incubating at 37°C for 30min. The absorbed pro- 
tein in each well was challenged with 50 jul diluted human anti-SARS 
sera and incubated at 37°C for 1h. After three washes with washing 
buffer, 50 ul diluted peroxidase-conjugated secondary antibody was 
added to each well and incubated at 37°C for 1h. Following three 
washes, 50 ul chromogenic substrate, 2,2’-azinobis(3-ethylbenzthiazo- 


line-sulfonic acid), was added to each well and incubated at 37°C for 
15min. The absorbance was read at 405 nm in an ELISA plate reader. 

For biotinylated ELISA, the procedure was performed as described 
previously with modification. Briefly, microtiter plates were coated 
with Stg/well cell extract which was diluted in 0.05M carbonate 
buffer, blocked with blocking buffer, and challenged with 0.1 g/well 
biotinylated S protein. After a 1-h incubation at 37°C, diluted per- 
oxidase-conjugated avidin was added and incubated at 37°C for 1h. 
Following three washes, chromogenic substrate was added to each well 
and incubated at 37°C for 15 min. The absorbance was read at 405 nm 
in an ELISA plate reader. 


Results and discussion 


The SARS-CoV genome is approximately 29.7 kb 
long and contains five major ORFs flanked by 5’ and 3’- 
untranslated regions of 265 and 342 nucleotides, re- 
spectively (Fig. 1A) [5,6]. The predicted S gene is located 
from nucleotide 21,492 to 25,259 in the SARS-CoV 
genome. In order to clone S gene, the 3580-bp DNA 
fragment containing whole S gene was amplified by re- 
verse transcription-PCR and inserted into prokaryotic 
vector pET-28(+). The deduced amino acid sequence of 
S protein is shown in Fig. 2. S gene had an ORF of 3768 
nucleotides, capable of coding for a 1255-amino-acid 
polypeptide of about 138kDa. The amino terminus of 
SARS-CoV S protein contained a short type I signal 
peptide (residue 1-13) composed of hydrophobic amino 
acids that are presumably removed during cotransla- 
tional transport through ER. The carboxyl terminus 
(residue 1196-1255) consisted of a transmembrane 
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domain and a cytoplasmic tail rich in cysteine residues. 
Twenty-three potential N-linked glycosylation sites were 
predicted among SARS-CoV S protein. Together these 
data predicted that SARS-CoV S protein is a type I 
membrane glycoprotein with the N-terminus and the 
majority of the protein (residue 14-1195) on the outside 
of virus particle, in agreement with other coronavirus S 
protein data (Fig. 1B). 

Multiple alignment and phylogenetic analysis among 
S proteins of mammalian coronaviruses, which cause 
respiratory infection, were further performed by PileUp 
program of Genetics Computer Group (GCG) (Figs. 2 
and 3). Alignment of these sequences produced a low 
level of similarity (20-27% pairwise amino acid identity) 
between the predicted amino acid sequence of SARS- 
CoV S protein and other coronavirus S proteins. Phy- 
logenetic analysis showed that the species formed 
monophyletic clusters consistent with established taxo- 
nomic groups. However, SARS-CoV S protein se- 
quences segregated into a_ well-resolved branch, 
indicating that SARS-CoV S protein is not closely re- 
lated to any of the previously characterized coronavirus 
S proteins. Although overall sequence conservation was 
low, the C-terminus, consisting of a transmembrane 
domain and a cytoplasmic tail, was highly conserved. 
Putative cellular receptor-binding sites (residue 757-761) 
of SARS-CoV S protein [19] were not conserved com- 
pared with other coronaviruses. The antigenic sites of 
porcine transmissible gastroenteritis virus S protein 
([20,21] were also varied among these coronaviruses. 
These data suggested that comparison of primary amino 


acid sequences does not provide insight into the recep- 
tor-binding specificity or antigenic properties of SARS- 
CoV S protein. 

By analyzing the primary structure of SARS-CoV S 
protein, we tried to predict the putative antigenic regions 
of S protein. The hydrophilicity, surface probability, 
and chain flexibility of SARS-CoV S protein were cal- 
culated by PeptideStructure program of GCG according 
to Kyte and Doolittle plots, Emini prediction, and 
Karplus and Schulz prediction, respectively (Fig. 4A) 
[22-24]. The antigenicity of SARS-CoV S protein was 
further analyzed based on three aforementioned criteria, 
and the regions in which antigenic index exceeds 1.3 are 
shown in Fig. 4B. Most of the putative antigenic sites 
were located at outer membrane regions, in which resi- 
due 1-100 and 401-500 exhibited the highly antigenic 
potential. Because most T cell and B cell epitopes con- 
tain a sequence of 6—20-amino acids [25,26], we further 
analyzed the antigenicity using a window of 15 residues. 
Residue 12-50, 426-456, 478-494, 541-564, and 922- 
1118 of SARS-CoV S protein displayed the highly an- 
tigenic potential. These data suggested that these regions 
might be good candidates for developing SARS peptide 
vaccine. 

In order to study the antigenicity and receptor- 
binding ability of SARS-CoV S protein, we expressed 
and purified the full-length recombinant SARS-CoV S 
protein from E. coli. We expressed the S protein from 
E. coli BL21(DE3)pLysS strain transformed with a pET 
plasmid carrying S gene. After induction with IPTG, a 
product with 138-kDa was observed by SDS-PAGE 


Fig. 3. Phylogenetic analysis of coronavirus S proteins. Amino acid sequences of SARS-CoV S proteins were compared with those of HCoV-229E, 
TGEV, FIPV, HCoV-0C43, MHV, HEV, BCoV, avian infectious bronchitis virus (IBV) (GenBank Accession No. P11223), and turkey coronavirus 
(TCoV) (GenBank Accession No. AAQ20922). Sequence alignment and neighbor-joining tree were generated by GrowTree program of GCG using 
the BLOSUM comparison matrix. Coronavirus groups I, II, and III are indicated. Branch lengths are proportionate to amino acid differences. 
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Fig. 4. Analysis of the primary sequence of SARS-CoV S protein. (A) The hydrophilicity, surface probability, and chain flexibility of SARS-CoV S 
protein were analyzed by PeptideStructure program of GCG. (B) Antigenic analysis of SARS-CoV S protein. The secondary structure and putative 
antigenic sites of SARS-CoV S protein were analyzed by PeptideStructure program of GCG using the Chou-Fasman prediction. Secondary structure 
was drawn with the PloStructure program. The predicted antigenic sites are shown as octagons. The putative glycosylation sites are shown as 


balloons. 


analysis (Fig. 5). The amount of induced S protein was 
consistent when the bacteria were refreshed to 0.45—0.6 
ODg6o0 and the recombinant S protein was expressed in 
the soluble form in bacterial cells. The soluble S protein 
was further purified by affinity chromatography using 
His-Bond resin. No detectable S protein was purified, 
suggesting that the histidine tag is folded into the inte- 
rior of S protein (data not shown). We therefore dena- 


tured the S protein by urea, purified by nickel-affinity 
chromatography, and renatured the protein by dialysis. 
The amount of recombinant S protein recovered was 
approximately 0.2-0.3mg/100ml of bacterial culture 
(Fig. 6). 

To analyze the antigenicity of recombinant S protein, 
we performed Western blot and ELISA using sera from 
SARS patients or from spike-immunized rabbits. Fig. 7 


T.-Y. Ho et al. | Biochemical and Biophysical Research Communications 313 (2004) 938-947 945 


— =~ 


7— — a 
66 — 
sa 
uo 


Fig. 5. Optimal condition and distribution of recombinant SARS-CoV 
S protein in E. coli. (A) Optimal condition for expressing recombinant 
S protein. The E. coli BL21(DE3)pLysS strain containing pET-spike 
was inoculated at 37°C with shaking. After an overnight incubation, 
the bacteria were refreshed to 0.45 (lane 3), 0.5 (lane 4), 0.55 (lane 5) or 
0.6 ODeoo (lane 6) and the IPTG was then added to induce the protein 
expression. The preparations of uninduced (lane 2) and induced EF. coli 
(lanes 3-6) were analyzed by 10% SDS-PAGE and stained by 
Coomassie brilliant blue. (B) Distribution of recombinant S protein in 
E. coli. After the induction of IPTG, the total proteins (lane 3), soluble 
(lane 4), and insoluble fractions (lane 5) of induced E. coli were ana- 
lyzed by 10% SDS-PAGE and stained by Coomassie brilliant blue. 
Lane 2 represents the preparation of uninduced E. coli. The molecular 
masses of protein standard (lane 1) are shown at the left. The 138-kDa 
recombinant S protein is indicated by the arrow. 
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Fig. 6. SDS-PAGE analysis of recombinant SARS-CoV S protein. The 
total proteins (lane 3), soluble (lane 4), unbinding (lane 5), and purified 
fractions (lane 6) of induced E. coli were analyzed by 10% SDS-PAGE 
and stained by Coomassie brilliant blue. Lane 2 represents the prep- 
aration of uninduced E. coli. The molecular masses of protein standard 
(lane 1) are shown at the left. The 138-kDa recombinant S protein is 
indicated by the arrow. 


shows that recombinant S was detectable by both sera in 
both assays. Additionally, truncated S protein (residue 
511-993) exhibited the similar antigenicity compared 
with full-length spike (residue 1-1255). These results 
indicated that recombinant S protein remains its anti- 
genicity that could be recognized by sera from SARS 
patients. 

The receptor-binding ability of recombinant S pro- 
tein was analyzed by biotinylated ELISA and Western 
blot. Several reports indicated that SARS-CoV could 
have succeeded in growing progeny virus in Vero cells 
[3,5,6]. The BSA and Vero cell extracts were therefore 
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BSA Recombinant spike Recombinant spike 


(1-1255 a.a.) (511-993) 


Fig. 7. Antigenic analysis of recombinant SARS-CoV S protein. (A) 
Western blot analysis. The preparations of uninduced FE. coli (lanes 1 
and 4), induced E. coli (lanes 3 and 6), and purified S protein (lanes 2 
and 5) were analyzed by 10% SDS-PAGE, transferred to nitrocellulose 
membrane, detected by rabbit anti-spike antibody (lanes 1-3) or 
human anti-SARS-CoV antibody (lanes 4-6), and subsequently visu- 
alized by chemiluminescence. (B) ELISA. The BSA and recombinant S 
proteins (residue 1—-1255 and residue 511-993) were coated on ELISA 
plates and detected by human anti-SARS-CoV antibody. Values are 
means + standard error of triplicate assays. 
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Fig. 8. Receptor-binding ability of recombinant S protein. (A) Biotiny- 
lated ELISA. The BSA, Vero cell extract, and Vero E6 cell extract were 
coated on ELISA plates and detected by biotin-labeled S protein. Values 
are means+standard error with triplicate assays. (B) Biotinylated 
Western blot. The cell extracts of Vero E6 (lane 1), Vero (lane 2), A-549 
(lane 3), Chang liver (lane 4), and U-937 (lane 5) were separated by SDS— 
PAGE, transferred to membrane, and detected by biotin-labeled S 
protein. The molecular masses of protein standard are shown at the left. 
The proteins present in Vero cells only are indicated by arrowheads. 
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coated on ELISA plates and challenged with biotin- 
labeled S protein. The binding ability of S protein to cell 
extract was evaluated by OD4o; value in ELISA. Fig. 8A 
shows that the binding ability of S protein to BSA was 
very low. However, S protein significantly bound Vero 
cell extracts with the OD4os exceeding 0.9. These results 
proved the specificity of biotinylated ELISA in analyz- 
ing the receptor-binding ability of S protein. It also 
suggested that recombinant S protein could be served as 
a probe to analyze the cellular receptors involved in 
virus attachment. 

To further identify the potential cellular receptors for 
SARS-CoV attachment, we performed biotinylated 
Western blot. The cell extracts were separated by SDS— 
PAGE, transferred to membrane, and detected by bio- 
tin-labeled S protein. Fig. 8B shows that recombinant S 
protein interacted with several cellular proteins in dif- 
ferent cell types. By comparison of protein patterns of 
different cell types, we found that two proteins with 
molecular masses of 130 and 140 kDa were detectable in 
Vero cells instead of other cell types. It is now known 
that Vero cell is the only cell line that could be infected 
by SARS-CoV [3]. These results suggested that 130-kDa 
and 140-kDa polypeptides in Vero cells might be the 
cellular receptors responsible for SARS-CoV S protein 
binding. The elucidation of amino acid sequences of 
these proteins is now proceeded. 

In this study, we cloned, expressed, and purified the 
SARS-CoV S protein from E. coli. The recombinant S 
protein was expressed in soluble form in bacterial cells, 
and the amount of protein recovered was 0.2-0.3 mg/ 
100 ml bacterial culture. The S protein was recognized by 
sera from SARS patients, indicating that recombinant S 
protein retained its antigenicity. By biotinylated ELISA 
and Western blot using recombinant S protein as the 
probe, we identified that 130-kDa and 140-kDa proteins 
might be the cellular receptors responsible for SARS-CoV 
infection. These results suggested that S protein could bea 
good candidate for further developing SARS vaccine and 
anti-SARS therapy. 
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